Feature/kimi k25 pp support by chun-wan · Pull Request #2 · MHYangAMD/sglang

chun-wan · 2026-02-10T14:30:26Z

Motivation

Modifications

Accuracy Tests

Benchmarking and Profiling

Checklist

Format your code according to the Format code with pre-commit.
Add unit tests according to the Run and add unit tests.
Update documentation according to Write documentations.
Provide accuracy and speed benchmark results according to Test the accuracy and Benchmark the speed.
Follow the SGLang code style guidance.

Review Process

Ping Merge Oncalls to start the PR flow. See the PR Merge Process.
Get approvals from CODEOWNERS and other reviewers.
Trigger CI tests with comments or contact authorized users to do so.
- /tag-run-ci-label, /rerun-failed-ci, /tag-and-rerun-ci
After green CI and required approvals, ask Merge Oncalls to merge.

- Add INT4 W4A16 MoE config for E=384,N=128 (Kimi K2.5) - Add FP8 W8A8 configs for AMD Instinct MI300X - Update fused_moe_triton layer for W4A16 support - Update compressed_tensors_moe for INT4 quantization - Add HIP kernels for ROCm support

Feature: Support torch compile for Kimi-K2.5

Changes: - kimi_k25.py: Add pp_proxy_tensors parameter to forward method and pass it to general_mm_embed_routine - deepseek_v2.py: Fix device acquisition for non-first PP rank when input_embeds and input_ids are both None Tested with TP=4, PP=2 configuration: - Prefill test: 10 requests, 100% success - Decode test: 20 requests, 100% success

chun-wan and others added 5 commits February 7, 2026 21:23

Update scale layout

5865bd7

Feature: Support torch compile for Kimi-K2.5

563df5a

Merge pull request sgl-project#1 from bobofang11235/dev/torch_compile

4179fdb

Feature: Support torch compile for Kimi-K2.5

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature/kimi k25 pp support#2

Feature/kimi k25 pp support#2
chun-wan wants to merge 5 commits intoMHYangAMD:mainfrom
chun-wan:feature/kimi-k25-pp-support

chun-wan commented Feb 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

chun-wan commented Feb 10, 2026

Motivation

Modifications

Accuracy Tests

Benchmarking and Profiling

Checklist

Review Process

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants